Sound-Imitation Word Recognition for Environmental Sounds Disambiguation in Determining Phonemes of Sound-Imitation Words

نویسندگان

  • Kazushi Ishihara
  • Kazunori Komatani
  • Tetsuya Ogata
چکیده

Environmental sounds are very helpful in understanding environmental situations and in telling the approach of danger, and sound-imitation words (sound-related onomatopoeia) are important expressions to inform such sounds in human communication, especially in Japanese language. In this paper, we design a method to recognize sound-imitation words (SIWs) for environmental sounds. Critical issues in recognizing SIW are how to divide an environmental sound into recognition units and how to resolve representation ambiguity of the sounds. To solve these problems, we designed three-stage procedure that transforms environmental sounds into sound-imitation words, and phoneme group expressions that can represent ambiguous sounds. The three-stage procedure is as follows: (1) a whole waveform is divided into some chunks, (2) the chunks are transformed into sound-imitation syllables by phoneme recognition, (3) a sound-imitation word is constructed from sound-imitation syllables according to the requirements of the Japanese language. Ambiguity problem is that an environmental sound is often recognized differently by different listeners even under the same situation.Phoneme group expressions are new phonemes for environmental sounds, and they can express multiple sound-imitation words by one word.We designed two sets of phoneme groups: “a set of basic phoneme group” and “a set of articulation-based phoneme group” to absorb the ambiguity. Based on subjective experiments, the set of basic phoneme groups proved more appropriate to represent environmental sounds than the articulation-based one or a set of normal Japaneses phonemes.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Disambiguation in determining phonemes of sound-imitation words for environmental sound recognition

Onomatopoeia, or sound-imitation words (SIWs) are important in informing sound events in human-computer communication. One problem is listener-dependency in recognizing environmental sounds by means of SIWs, that is, different listener hears the same environmental sound as a different SIW even under the same condition. Therefore, the use of usual Japanese phonemes is not adequate to express SIW...

متن کامل

Automatic transformation of environmental sounds into sound-imitation words based on Japanese syllable structure

Sound-imitation words, a sound-related subset of onomatopoeia, are important for computer-human interaction and automatic tagging of sound archives. The main problem of automatic recognition of sound-imitation word is that the literal representation of such words is dependent on listeners and influenced by a particular cultural history. Based on our preliminary experiments of such dependency an...

متن کامل

Structure to speech conversion - speech generation based on infant-like vocal imitation

This paper proposes a new framework of speech generation by imitating “infants’ vocal imitation”. Most of the speech synthesizers take a phoneme sequence as input and generate speech by converting each of the phonemes into a sound sequentially. In other words, they simulate a human process of reading text out. However, infants usually acquire speech generation ability without text or phoneme se...

متن کامل

Imitation games for complex utterances

This paper presents a preliminary experiment in modelling the emergence of repertoires of complex words in a population of agents. Agents that can produce words, which are strings of (abstract) phonemes, try to imitate each other as well as possible. For this they do not only have to learn each other’s words, but also each other’s phonemes. But they cannot know a priori which sounds are actuall...

متن کامل

Optimal event search using a structural cost function - improvement of structure to speech conversion

This paper describes a new and improved method for the framework of structure to speech conversion we previously proposed. Most of the speech synthesizers take a phoneme sequence as input and generate speech by converting each of the phonemes into its corresponding sound. In other words, they simulate a human process of reading text out. However, infants usually acquire speech communication abi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005